Search CORE

3 research outputs found

Machine Translation: Phrase-Based, Rule-Based and Neural Approaches with Linguistic Evaluation

Author: Avramidis Eleftherios
Burchardt Aljoscha
Helcl Jindrich
Macketanz Vivien
Srivastava Ankit
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 26/06/2017
Field of study

Deep Architectures for Neural Machine Translation

Author: Birch Alexandra
Haddow Barry
Helcl Jindrich
Miceli Barone Antonio Valerio
Sennrich Rico
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

It has been shown that increasing model depth improves the quality of neural machine translation. However, different architectural variants to increase model depth have been proposed, and so far, there has been no thorough comparative study. In this work, we describe and evaluate several existing approaches to introduce depth in neural machine translation. Additionally, we explore novel architectural variants, including deep transition RNNs, and we vary how attention is used in the deep decoder. We introduce a novel "BiDeep" RNN architecture that combines deep transition RNNs and stacked RNNs. Our evaluation is carried out on the English to German WMT news translation dataset, using a single-GPU machine for both training and inference. We find that several of our proposed architectures improve upon existing approaches in terms of speed and translation quality. We obtain best improvements with a BiDeep RNN of combined depth 8, obtaining an average improvement of 1.5 BLEU over a strong shallow baseline. We release our code for ease of adoption.Comment: WMT 2017 research trac

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Biblio at Institute of Formal and Applied Linguistics

Surprise Language Challenge: Developing a Neural Machine Translation System between Pashto and English in Two Months

Author: Aziz Wilker
Birch Alexandra
Esplà-Gomis Miquel
Forcada Mikel
Haddow Barry
Helcl Jindrich
Macquarrie Kay
Murady Lina
Pérez-Ortiz Juan Antonio
Sariisik Sevi
Sánchez Cartagena Victor
Sánchez Martinez Felipe
Valerio Miceli Barone Antonio
van der Kreeft Peggy
Waldendorf Jonas
Publication venue
Publication date: 16/08/2021
Field of study

In the media industry and the focus of global reporting can shift overnight. There is a compelling need to be able to develop new machine translation systems in a short period of time and in order to more efficiently cover quickly developing stories. As part of the EU project GoURMET and which focusses on low-resource machine translation and our media partners selected a surprise language for which a machine translation system had to be built and evaluated in two months(February and March 2021). The language selected was Pashto and an Indo-Iranian language spoken in Afghanistan and Pakistan and India. In this period we completed the full pipeline of development of a neural machine translation system: data crawling and cleaning and aligning and creating test sets and developing and testing models and and delivering them to the user partners. In this paperwe describe rapid data creation and experiments with transfer learning and pretraining for this low-resource language pair. We find that starting from an existing large model pre-trained on 50languages leads to far better BLEU scores than pretraining on one high-resource language pair with a smaller model. We also present human evaluation of our systems and which indicates that the resulting systems perform better than a freely available commercial system when translating from English into Pashto direction and and similarly when translating from Pashto into English

Edinburgh Research Explorer

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY